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ABSTRACT 


A rotation sampling plan to estimate hog inventories is presented 
and a composite double sampling regression or ratio estimator is 
compared to a single time direct expansion estimator. 


Key words: Rotation sampling, successive sampling, panel studies, 
regression or ratio estimation, change estimation, double sampling. 
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SUMMARY 

An analysis of different rotation designs to be used with the 
list frame in the Multiple Frame Quarterly Hog Survey is presented and 
analyzed. The rotation design discussed deals only with a four-quarter 
survey period and does not address between year rotation plans. A com- 
bined ratio estimator, separate ratio, and separate regression estima- 
tor are compared to the direct expansion estimator. 

A rotation design where 50 percent of the sample units are matched 
between quarters is suggested from the analysis. The rotation plan 


suggested is: 


Dec March June Sept 
$1 $1 
So So 
$4 S4 
S S 
4 4 


where S5 is a replication of size 1/2 the stratum allocation. 
This design allows for estimation of current inventory with the 
capability of more reliable estimates of change to be developed. The 


design also reduces respondent burden to two contacts per year in nearly 


all strata. 


INTRODUCTION 

Current Sartre: Currently, ESCS uses several different rotation 
sampling plans for the list frame in the Hog Multiple Frame Survey. All 
rotation plans discussed in this paper deal with sampling within a year for 
quarterly estimates in December, March, June and September. This list frame 
is updated annually before selection of the sample(s) to be used for the 
quarterly surveys. The problem of rotating the sample between years given 
an annual update is reserved for a later paper. 

Figure 1 gives the types of rotation plans currently used within strata 
by states on the quarterly hog survey. In the strata below extreme operators, 
several states select one sample from their list for use during the entire 
survey period of four quarterly surveys. This gives 100 percent overlap 
between quarters as illustrated by Plan 1 in Figure 1. In other states a 
new sample is selected midway within the year and there is no overlap in the 
sample between quarters (Plan 2, Figure 1). Note that Plan 3 is a simple 
variation of Plan 2, but an individual operator would be contacted at six-month 
intervals. In some E.0O. strata, ESCS has a policy of no overlap between ea 
periods. This rotation design (Plan 4) minimizes the respondent burden for 
larger operations to one contact for an entire year. Each state may use a 
combination of each of the above plans. 

Figure 1: Current Rotation Plans used on ESCS Hog Quarterly Survey 


Plan 1: One Sample, held in 4 times, 100% overlap (OL) between periods 


Dec March June Sept 
S S S S 


iL UL 1 1 


Plan 2: Two samples, each held in two periods, 100% OL between 


Dec/March, 0% OL between March/June 


Dec March June Sept 
Sy 7 
So Ss, 


Plan 3: Two samples, each held in two periods, 100% OL between 


Dec/June 0% OL between Dec/March, March/June, June/Sept 


Dec March June Sept 
Sy Sy 
So So 


Plan 4: Four samples, 0% OL between survey periods 


Dec March June Sept 
sil 
23 
eh 


Table Al in Appendix A gives the 1978 rotation plan for the 14 


hog multiple frame states. It can be seen in Table Al that a multi- 
tude of different rotation plans are being used in the quarterly hog 
survey. No consistent overall policy on rotation for the quarterly 
hog surveys exists at present. 

Estimation of Change: The use of successive sample surveys of 
the same population provides flexibility in choosing a sampling design. 
Different estimators can be utilized to supplement a direct expansion 


estimate based only on current survey data. Repetitive sampling 


ed yd 

provides the necessary data to form eerinaces based on change. Hetie 
mates of survey to survey changes are made by matching reports from 
the same sampling unit surveyed in two or more successive 

surveys. 

Using matched reports as a basis for estimation has a long his- 
tory with ESCS -- Statistics. This approach continues to be heavily 
relied upon for nonprobability surveys. An indication called the 
"current/current" ratio is calculated and applied to previous sur- 
vey estimates to provide a current survey estimate. A variation of 
the "current/current" indication is the "current/historic" ratio. 

The latter indication also measures change from a previous survey to 
the current period, but the data for the prior period are collected on 
the current questionnaire. The advantage is that no prior survey 
needs to be conducted, therefore, reducing respondent burden. This 
would be particularly useful when a statistical series is being 
initiated since prior data for sample units in the current survey would 
not be available. For an ongoing program, however, it has been found 
that data reported for the preceding periodare often subject to memory 
bias [4]. 

For probability surveys, estimates can be made without prior sur- 
vey data for matched sample units. Direct expansion estimates have 
been given primary emphasis for our June Enumerative and Multiple Frame 
Livestock survey. This does not mean that one should ignore available 
prior information. 

Ratios and ratio estimators have been computed based on matched 


segments in the June and December Enumerative surveys. Extreme 
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operators have been excluded from the matching procedure. The esti- 
mated sampling error associated with the ratio estimators has been about 
the same as those associated with the direct expansion estimate. There- 
fore, the ratio estimator has not been emphasized. No attempt has been 
made to form a composite estimate based on combining the ratio estimate 
from matched segments with a direct expansion for the unmatched "rotated 
in' segments. 

Little use has been made of matching reports from multiple frame 
surveys, even though the same sample unit is often surveyed in two or 
more successive surveys. Cochran [4, p. 342] states that if we wish to 
maximize precision, the following can be said about replacement policy: 

"1. For estimating change, it is best to retain the same sample 

throughout all occasions. 

2. For estimating the average over all occasions, it is best to 
draw a new sample on each occasion. 

3. For current estimates, equal precision is obtained either 
by keeping the same sample or by changing it on every 
occasion. Replacement of part of the sample on each occa- 
sion may be better than these alternatives." 

Since we are not interested in estimating the average number of 
hogs in a calendar year, statement two above does not apply to our 
situation. We will show that a partial rotation plan of 
approximately 50% matched units between occasions is well suited for 


estimation of hog inventories. 


THE ROTATION PLAN 


Use of successive sampling or rotation sampling 


in surveys has been investigated by many authors for several types of 
surveys [1,2,5,6,8,10,11,12,14]. Gleason and Tortora[8] have extended 
the successive sampling theory for use with two overlapping sampling 
frames, as in ESCS livestock multiple frame surveys. In the analysis 
which follows three different successive sampling estimators will be 
compared in terms of precision to the direct expansion estimate on a 
stratum by stratum basis and then for estimation of the population 
total for the list. The estimators are: 

1. The ratio estimate within strata. 

2. The regression estimate within strata. 

3. The combined ratio estimate over all strata. 

Appendix B gives the appropriate mathematical and statistical 
expressions for the above estimators and their variances. 

The list portion of the multiple frame data file was reformatted, 
and merged together by stratum and operator ID for each data set. An 
approximate 50% sample within strata was selected from all operators 
who responded in both survey periods. The sample was selected to 
simulate a rotation design of approximately 50% matching of units 
between survey periods. This matching will allow us to develop ratio 
and regression estimators and estimate optimum matching percentage 
between survey periods. Table A3 in Appendix A gives the number of 


matched samples selected by stratum and state. 


Figure 2 below depicts the rotation design which will be investigated. 


Figure 2 
Dec March June Sept 
2 4 
e 25 
23 ”3 
4 Br 


S is a random sample of size % of those who responded both times. 

Note that for this rotation design only two contacts of each operation 
per year is made and this design permits composite estimation using matched 
and unmatched reports. 

In the analysis sections which follow we will investigate the list frame 
strata in the following states and survey periods: 

Iowa 1978 December, March 

Iowa 1976 June, September 

Minnesota 1977 June, September 


Kansas 1977 December, March 


PORTION OF SAMPLE TO BE RETAINED 

One of the major considerations in an ongoing survey is what percentage 
of the sample should be retained from one survey period to the next. Since 
the quarterly hog survey is a stratified design the optimum percentage to 
match varies by stratum. Table 1 gives the optimum percent to match for each 
data set analyzed. The variable considered was "total hogs and pigs." The 


appropriate mathematical expression for estimation of the optimum matching 


oS tee 


Table 1: Correlation coefficients between survey periods for total hogs and 
optimum percentage to match, by state, stratum. 


Iowa Dec/March 1978 Iowa June/Sept 1976 
enya Correl Optimum 7 tO Match Sera tue Correl Optimum % to Match 
Coeii = res est ratio jest Coeff rep est ratio ene 
a 70 49 24 3 a0 ~42 43 
3 sche Sel Fa), 4 Ale) 4 oe) 41 
4 oid 41 eae é) he a6) ~44 
5 rol 41 ~42 6 84 237 . 38 
6 -67 44 44 7 . 86 . 36 36 
806 ALS} aad! 20 
Minnesota June/Sept 1977 Kansas Dec/Mar 1977 
Stratum Core! Optimum % of Match "tym Corel Optimum 2 of Match 
Coeff reg est ratio est Coeff reg est ratio est 
11 Behe) 06 14 10 A535) 34 Fae }) 
12 96 yas ee 12 80 ue | ee) 
Sle eee 47 54 14 5 40 40 
61 708 34 2o/ LD 74 41 42 
62 a19 237 Shas 16 she) 42 44 
63 70) - 36 700 
64 74 41 42 


65 82 -40 41 






































percentage for both a regression or ratio estimator is provided in Appendix B. 


The optimum percentage for the different types of estimators is a function of 


the correlation coefficient between survey periods for the same variable. 


The optimum percentage to match is very similar for the regression and 
the ratio estimates. The optimum percentage to match for both estimators is 
generally between 30 and 50 percent. Fourty-one (41) percent is the median 
of the percentages to optimally match over all states and strata analyzed. 
Therefore, assuming an overall 80 percent response rate, 50 percent of the 
units should be retained between survey periods by stratum to operationally 
have a 40 percent matched sample between survey periods. 

Also, inspecting Table 1 for Iowa we see that stratum correlation 
coefficients do not remain constant between years or different survey periods. 
It is planned to investigate this further for an entire four quarter survey 
period. 

In the next section we will compare the various estimators with respect 


to their variances. 


COMPARISON OF ESTIMATORS 

The three composite estimators which are to be compared to the direct 
expansion estimator are presented in Appendix B. Basically the composite 
estimator is of the form: 


Y =W be + (1 - W) uae (1) 


“a 


where Yn is a ratio or regression estimate of the total from the match 


a 


portion of the sample, oe is an estimate of the total from the unmatched 


1 (yee 


portion of the sample and W is a weight used to combine the two independent 
estimates. ya can be a combined estimate over all strata which have matching 
units or might consist of separate ratio or regression estimators for each 
stratum. The mean/unit, regression and ratio estimates for each state and 
stratum appear in Table A4 in Appendix A. 

Table 2 gives the relationship between estimates of the total and the 
coefficient of variation for each estimator. Note that the direct expansion 
estimate is considered as the base estimate with each of the other estimators 
written as a percentage of the direct expansion. In most cases the composite 


estimators are within 2 percent of the direct expansion except for the combined © 


ratio estimate in Minnesota, 





eee eae 
Table 2: Estimate of total and for direct expansion, regression, separate 
ratio and combined ratio estimators 


Separate Combined Combine 
Deke DeteumRegr, Ree Ratio Z i. of Ratio 
State Base % CoVemecob. Dik. CEV..  OL_ DIE. CRY ae ee ate 
(4) (%) (000) (%) (000) (%) (000) 

Iowa March 

(1978) 100.0 Sie) 100.0 hey? 101.0 Sea 100.0 ee! 
Iowa Sept 

(1976) 100.0 3.0 10171 Shas) LOLS hg * * 
Minnesota Sept 

(1977) Pe 100.0 pee avis Se his 3.8 101.8 3.8 
Kansas March 

x 100.0 1174 100.9 41.3 101.25-5 lle? 4. 4o1eo meee 


CL O:7) 


* 
not available 
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Table 2 also shows that the precision of the estimate was slightly 
improved when using a regression or ratio estimate over a direct expansion 
estimate. In addition, the successive sampling estimators may be more stable 
since a portion of the sample is retained from quarter to quarter and the 
estimates are linked through the use of the composite estimator. 

Table 3 shows in percentage terms the gain in precision on the variance 
of the respective estimators over the direct expansion. Gains are modest in 
most cases. Little gain was realized for Kansas since the composite estimators 
were used only in strata 14, 15, and 16, and direct expansion estimates in the 


other strata. 


Table 3: Gain in precision (variance of the estimate) for regression, ratio, 
and combined ratio estimators over the direct expansion estimator 


Gain Reg Gain Separate Gain Combined 
State Over D.E. Ratio Over D.E. Ratio Over D.E. 
8 ve v/s 
Iowa March 
; 6.2 a 
(1978) — 
Iowa Sept 
Ze 12.0 * 
(1976) bert 
Minnesota TAP RERS) i Wr pg 16.0 
Kansas A a — 


ade available 

Note that in this development we were interested only in current estimates 
of level. Efficient estimators of quarter to quarter change should also be 
investigated. Other government agencies are already using rotation designs to 
estimate current level and change for their operational data series [3,9,15,16]. 
However, no optimal solution exists to estimate both change and level 


simultaneously. Further theoretical research needs to be done to attack this 


ut oe 


important survey problem. This problem can be simply stated as follows: 


% 
‘ 


The optimum estimator of change added to the previous quarter's estimate does 
not necessarily equal the best estimate of the current quarter's level. 

One of the most important aspects of using a composite estimator is 
choosing the weight (W) to combine the matched and unmatched portion of the 
sample for the composite estimator. Good sampling design practice dictates 
that this quantity should be fixed prior to estimation and not estimated from 
the sample used [4]. However, when several characteritics are being estimated 


such as total hogs, farrowings and breeding stock, etc., these weights are 





going to vary for each characteristic and list frame stratum. It would be 

extremely difficult to estimate an optimum weight for each characteristic and 
stratum. Therefore, it is desirable to fix W prior to conducting the survey. 
The range of optimum weights for each type of estimator as computed from the 


surveys studied may be seen in Table A6 in Appendix A; in most cases 


the weight is nearly one-half. Based on these results and the fact 
that any weight W provides an unbiased estimator, it is recommended that a 
weight of .50 be used to combine the unmatched and matched portion of the 
sample in a composite regression or ratio estimator. 

A question also arises for the regression estimator about the values of 
the regression coefficients Bo and By and their estimation from the matched 
sample. The BS coefficient varies considerably by strata and no definite 
fixed value can be suggested from the analysis. The BS term in the estimated 
regression does not vary as much as the BS term, but fixing a value for 
estimation over all strata for all characterstics at this point is a bit 


premature. However, Cochran [4, p. 191] shows that such a fixed By value 
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leads to an unbiased estimate of the mean. Since the estimates of Pn and By 
varied considerably by strata it was felt that a combined regression estimate 
would not be appropriate in thissitutation [4, p. 203 and 204] . The dangers 
of bias with small sample sizes (as in Minnesota Strata 11) may also put the 
separate regression estimate into an unfavorable category. 

It seems clear that if one is estimating the paramters of the model 

y=Bo +B, x 

where y is the number of hogs for the current quarter and x is the previous 
quarter's total hogs that Bo should be o and By would be just a percentage 
decrease or increase from the previous month's level, i.e., a ratio estimate 
would be appropriate. Inspecting Table A/7 in Appendix A we see that for most 
state/stratum combinations that we do not reject the null hypothesis that 
B =o ata= .05. Hence it appears for most strata PHAt a separate ratio 
estimator as opposed to a regression estimator is an appropriate model for 
estimation of current level. 

Since each of the estimators attain similar precision on estimation of 


the total no clearcut choice of the regression, separate ratio or combined 





ratio estimators can be made at this time. However, we recommend computing 
all three estimators and comparing them to the direct expansion estimator. 
This comparison should be made on both level and precision. These additional 
estimators may provide extremely valuable information in off quarter analysis 
(March, September) since only the multiple frame estimate is computed and 


charted for these quarters. 
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FUTURE STUDY 


The proposed rotation plan suggested has several attractive features which 


may provide the necessary framework to retain a portion of the sample even 
after list update (usually in November). The "adds'' and "deletes" from the 
list could be sampled as well as operations that change between strata. These 
components of a more complex composite successive sampling estimator could 
then be used to make estimates of quarter to quarter change after an update 

to the list. Of course this type of estimation would require a complica 
historical data base system, as is currently being designed into the List 
Sampling Frame. 

Another possibility, to lessen the potential impact on the U.S. indication 
of an entirely new sample across all states for the December survey is to 
rotate the time of update for individual states. Possibly one fourth of the 
states should update their list prior to each quarter. 

Further research into composite estimators similar in form to ones used 
by the Census Bureau [9,15,16] in their monthly retail trade surveys is recom- 
mended. The research conducted in this study investigated three frequently 
used successive sampling estimators. No estimator clearly distinquished 
itself as "best" in this study. However, all estimators were more precise 


than the direct expansion estimator. 





, 
: 
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CONCLUSIONS/ RECOMMENDATIONS 

1) With the proposed rotation sampling plan, all strata (except the 
largest E.0.) are rotated uniformly between quarters. This will promote 
consistency in the estimates over time at both the state and national level. 
It will end the variety of rotation plans currently being used incthe Agency 
(see Table Al). We recommend that 50 percent of the units be retained 
between quarters and that all states in the 14 hog multiple frame states be 


put on this rotation plan. The estimators also provide additional 


information in the critical March and September survey periods when only the 
multiple frame estimate is computed. 

2) Quantitative pesssape burden is limited to a contacts per ears 
instead of the up to four contacts in some strata, with guaranteed overlap 
for part of the sample in three of the four quarters. This takes advantage 
of the high correlation in survey data between quarters so that everyone from 
the previous quarter need not be recontacted. 

3) Double sampling regression or ratio estimators provide composite 
estimates with similar or smaller variance than the direct expansion estimate. 
The sample design still allows for direct expansion estimates to be computed 
with the same summary procedures while also permitting the computation of 
composite estimates for comparison purposes. A weight of one-half should be 


used to combine the matched and unmatched portions of the composite estimates. 


10. 


Section, ASA, 1976, Boston, Mass. 
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Appendix A 


Table Al; List Stratum Rotation Plans by State - Hog M.F. 1978 


State 


Georgia 
Illinois 


Indiana 


Iowa 

Kansas 
Kentucky 
Minnesota 
Missouri 
Nebraska 

N. Carolina 
Ohio 

S. Dakota 


Texas 


Wisconsin 


Plane 
One Sample, 


4 Contacts 


Stratum No. 


81 - 84 

98 

84 — 87 

82) = 86, 94 
B81. 82 

All Strata 
81 - 88 

81 - 83 
86, 88 

81 - 85 

81 - 88 


Bae 


Plan 2 


Two Samples, 
2, Contacts 
Stratum No. 


95 

81 - 86, 93 
94 

94 

95 

84 - 86, 94 
81 - 85 

95 

All Strata 
AlLi-Strata 


Plan 3 
Two Samples, 
2 Contacts 
Stratum No. 


SIGs 9S 


Plan 4 
Four Samples, 
1 Contact 
Stratum No. 
93 - 94 
81 = 835095 
83 - 93 
93 - 94 
93 
95 
93 - 94 
93 


All Strata 


ee 


Table A2: Number of samples by survey period and number of merged records by 
survey period. 


Iowa Dec/March 1978 






% OL Between 
Survey Periods 


Sample Size 






Stratum Description 


Ay No Hogs 395 396 0 
Z 1-199 Phe 272 100 
3 200-399 220 Zoe oh 
4 400-599 220 220 100 
5 600-999 269 2/3 99 
6 1000-1999 167 72 99 
805 2000-3999 ikgal Li 0 
806 4000-9999 78 78 100 
905 10,000+ 9 9 100 


Jeti eee eee. EEnEEnGEEEEEEEEEERER 


Iowa June/September 1976 














% OL Between 
Survey Periods 


Sample Size 





Stratum Description 


My No Livestock 303 304 0 
2 No Hogs 223 222 0 
3 1-199 239 Zo 100 
4 200-349 225 225 100 
5 350-599 283 283 100 
6 600-999 225 220 100 
ZL 1000-1999 138 138 100 
85 2000-8499 113 113 0 
95 85+ 20 aM CO) 


Sn Ones ee 
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Minnesota June/Sept 1977 











% OL Between 
Survey Periods 


Sample Size 


Description Tine Sept 


Stratum 


ed No Info 68 68 100 
12 No Livestock 358 358 100 
31 No Hogs 228 228 100 
61 1-74 362 362 100 
62 75-149 213 213 100 
63 150-299 208 208 100 
64 300-399 255 gas) 100 
65 400-699 141 141 100 
85 700-999 80 81 0 
86 1000-2499 74 1s 0 
ye 2500+ 14 14 100 
1/ 


Kansas Dec/March 1977 — 









% OL Between 
Survey Periods 


Sample Size 


Dec March 







Stratum Description 






14 ae) 348 348 100 
15 100-199 269 269 100 
16 200-599 ome: 233 100 
85 600-999 88 81 0 
86 1000-999 64 64 0 
95 4000-- 24 24 100 
Li ‘ape 


— Kansas, stratum 10 and 12, subsample 0's from the December Survey and used 
all positives from December in the March Sample. They actually formulate two 
strata (11 and 13) in March. For this analysis we used just the comparable 

strata. 





+S 


n 
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Table A3: Matched sample sizes for all data sets in the analysis 











Iowa Dec./March 1978 Iowa June/Sept 1976 
NO seUniEsS No. Units 
Responding Matched Sample Responding Match Sample 
Stratum Both Times Size Stratum Both Times Size 
1 0 0 1 0 0 
2 209 1 163) 2 0 0 
| 145 74 B 188 94 
4 138 Vie. 4 162 84 
5 184 97 3] 170 87 
6 108 56 6 Loy. 74 
805 0 0 7 86 oe 
806 69 36 85 0 0 
905 9 Ox 95 ae O* 
Minnesota June/Sept 1977 Kansas Dec/Mar 1977 
No. Units No. Units 
Responding Matched Sample Responding Match Sample 
Stratum Both Times Size Stratum Both Times Size 
11 63 ae 14 263 137 
Le 332 NeW 15 186 98 
61 306 [53 85 0 0 
62 207 96 86 0 0 
63 143 69 95 a4 Ox 
64 UTA 47 
65 92 50 
85 0 0) 
86 0 O* 
95 14 O* 


* Strata with 100 percent coverage was not subsampled 
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Table A4 ; Mean/Unit, Regression and Ratio Estimates 


Iowa Dec/March 1978 
Stratum Mean/Unit Regression Ratio 


2 G92 49.42 52365 
3 130599 13200 134.41 
4 214.65 ZED a1 2 213.64 
2) 359.07 $3859 358.62 
6 534.34 DZ6.09 aye a! 
806 1919.47 Tozi 305 1806.09 


Minnesota June/Sept 1977 
Stratum Mean/Unit Regression Ratio 


Li 4.03 Ze99 2.20D 
12 5.18 5.46 erate! 
31 Lee. 2019 Pe Ns 
61 ZO 206 24536 Zi10 
62 62.15 61.38 GsaZ 
63 Loi, 20 161.63 161.84 
64 226802 232800 oes Nek) 
65 388.69 399.48 402.02 


Iowa June/Sept 1976 
Stratum Mean/Unit Regression Ratio 


<i 


3 47.74 47.84 48.00 
4 136.06 136.07 135.62 
5 259 uo 244.58 246.26 © 
6 343.26 356.18 360.20 
fi 546.17 35 Oo 538.34 


Kansas Dec/March 1977 


Stratum Mean/Unit Regression Ratio 7 


14 45.27 47.54 48.10 
15 £22 51 P2306 126326 


16 274.61 278.02 280.87 


bt 


= 23 2 


Table A5: Gain in precision (in %) of regression and ratio over the direct expansion 


estimate. 


Iowa Dec/March 1978 


Gain Reg 


Gain Ratio 


Stratum Over Mean Over Mean 


Z 


3 


806 


4.1 


3.6 


28.2 


1533 


-5.4 


-1.5 


22.8 


2659 


14.5 


34.1 


Gain Reg 
Over Ratio 


10 


5 


ab 


+2 


es 


4 





Minnesota June/Sept 1977 


Gain Reg 


Gain Ratio 


Stratum Over Mean Over Mean 


as 


12 


31 


61 


62 


50.4 


44.7 


9 


47.9 


44.7 


Gain Reg 
Over Ratio 


1 


0. 


LD - 


/ 


0 


Iowa June/Sept 1971 


Gain Reg Gain Ratio Gain Reg 
Stratum Over Mean Over Mean Over Ratio 


3 16.8 Loa Loe 
4 2350 ZO 2 2.4 
5 21 <2 19.6 1.4 
6 30.6 (29.3 1.0 
7 30.7 30.4 Zz 





Kansas Dec/March 1977 


Gain Reg Gain Ratio Gain Reg 
Stratum Over Mean Over Mean Over Ratio 


14 | 2075 bs hg 4 
15 1979 19.0 oh 


16 1929 a a Lae 


me GFA 








Table a6: Statistics for Comparisonef Weights to Combine Matched and Unmatched Estimates 
of Totals : 
Weights , 
Matched Portion 
of Estimator 
; Ratio of 
Matched ae ade akbar Separate Combined Separate 
Sample Correlation cy /cv N ie Ratio Ratio Regressio 
Stratum Size Coefficient ibe re eee Weight Weight Weight 
M fe) A 8 
IOWA DEC/MAR 78 
2 115 - 38 1.01 1.02 47 -63 apy 
3 74 - 36 -80 1.05 43 63 46 
4 75 me 83 1.06 56 -63 56 
5 97 81 .97 1.11 59 63 60 
6 56 67 81 1.04 53 203 *.53 
8 
06 36 293 1.14 99 62 63 63 
KANSAS DEC/MARCH 77 
14 137 SHS 85 1.00 56 49 56 
15 98 74 88 1.02 56 49 56 
16 68 73 95 1.05 46 249 47 
MINNESOTA JUNE/SEPT 77 
il 31 99 akalb) 98 65 56 65 
12 157 96 95 99 62 56 62 
31 111 28 Reals) 96 243 56 51 
61 153 84 1.10 97 OPW! +56 58 
62 96 79 1.01 97 52 56 se) 
63 69 - 86 96 1.06 58 56 58 
64 47 74 97 1.02 48 56 -49 
65 50 83 95 1.11 62 56 62 


ee 


Table A7: Estimates of Regression Coefficients and t-Test of Hypothesis 
Hes 8. = 0 
O O 





Stratum Estimate of Parameters t-Test 


| B 8 Hatt & <= 0.00. =>. 
fe) il Oo O 


| Iowa Dec/Mar 1978 


2 ge 65 NS 
S. 61.70 ey) Significant 

4 39.40 .76 NS 
5 55.88 200 Significant 

6 122.03 .70 NS 
806 B91. RZ 282 Significant 


Kansas Dec/Mar 1977 


14 5/0 79 NS 
1 iby ent .76 NS 
16 63.70 ray ke’ Significant 


Minnesota June/Sept 1977 


ime ea 63 NS 
12 -.07 Fob NS 
aL 1.66 1.05 NS 
61 S235 .83 Significant 
62 BRS: olde NS 
63 le OD cod NS 
64 56.00 nog NS 
65 69.67 rod NS 


NS = Not Significant 


er) Gee 


Estimation Formulas for Regression, Ratio and Combined Ratio 
Estimate 


Appendix B: 
To develop this theory we summarize the notation as developed in Sen, et al. 


[13]. Within a list frame stratum let, 


mn, = total sample size on first occasion 

ny = total sample size on second occasion 

m = matched sample size of those operations who reported on both 
occasions. 

u, = sample size of those who responded only on the first time 

us = sample size of those who responded only on the second occasion, 
So, u, =n), -m and uy = 1, — M5 


Yrs total sample mean on first occasion 


rat total sample mean on second occasion 


ae = matched sample mean on first occasion 
Yom = matched sample mean on second occasion 
vay s unmatched sample mean on first occasion 
Soy = unmatched sample mean on second occasion 
2 : : ; ; 
Ss) = population variance on first occasion 
2 : ’ : 
55 = population variance on second occasion 


Ao a= (8,/Y,)/(S,/¥,)3 ratio of the C.V. on the first occasion to that 


on the second. 


p = correlation coefficient between yy and Yo: 
Z= A (2p - A) 

A= n/n, 

Q = n/n, 


—oge = 


So for example, n, is the total number of useable reports for a particular 
state and stratum. For purposes of this analysis we will ignore finite 
population corrections in comparing the regression, ratio and mean/unit estimates 


(direct expansion). 


Composite Ratio Estimation Within Strata 


The matched (m units) and unmatched units on second occasion provide 


independent estimates bys and oe of the population mean e on the second 


occasion. For the matched portion of the sample and improved estimate en of 


x, is the double sampling ratio estimate 


y 

-, 2m —_ 

Jom ins cs ¥y (1) 
Yim 


The variance of Yom? 


(n, - u, 4) 
-. ie 1 1 
V(¥5 ay 55 m n, (2). 


nn 


As an estimate of the variance we can replace ) and 4 by their appropriate 
sample estimates. Assuming A = 1 (C.V.'s between survey periods are equal. 
This is not an unreasonable assumption for the quarterly hog survey.), it is 
necessary for p > .5 for 58 to be a gain over i (Cochran p. 165). A ratio 
estimate, Yo, of the mean ¥3 of the population on the second occasion is 


given by combining the independent estimates Be and oe as 


Yor = Ww ‘oe + (1 - w) eae C3)% 
The best estimate of the mean “Gs is obtained by using values of w which 


minimize GG ake Taking the derivative of (3) with respect to w gives the 


value which minimizes V(y5)> which is 


w= Be ew) + Viy5 Ds (4) 


S582 


With this value for w the v(y,,) is 


52 {n, - u, 3} 
Yo i u 
n) ny - & uy uy 


VY? = (5) 


The gain of the ratio estimate over the mean/unit estimate (commonly called 


a direct expansion estimator) is 


G,= {v(y,) - V(¥n) WV y) + 
_ 2A (0 - 2) 
~ @-8% (0 - A) (6) 


The optimum percentage to match for the ratio estimate can be found by 


taking the derivative of CG) with respect to A. This gives 


r : Ov l= @ (7) 
Opty. abi0gs. a eee 
ees GPS 


To arrive at estimates of totals for the list frame each mean by strata 


was expanded by the population size N, and summed over all strata. Similar 


h 


computations were performed for variance estimation of the total. 


Composite Combined Ratio Estimation 
Instead of computing the estimator (1) on a stratum by stratum basis an 
alternative procedure is to compute a combined ratio over all strata where 


there were matched samples. To estimate the total on the second occasion this 


estimator becomes: 


A 


IN, y 
3. hee emb ae 
Voce We eee EN 


; 
h ~ 2uh 
Ne Yimh 


where the last subscript represents the h-th stratum, and 


Ny = population size in h-th stratun, 


Ee) Ors 


w is a weight which combines the matched and unmatched portion of 


the estimator, 


“a 


yy is the previous survey's direct expansion estimate of the total for 


all the matched strata. 


The variance of Yc is 


= cp aN S. -2R PL By ay 
< * 2 2h lh lh 2h 
Cytol et aa = ys 
a 
C2eRES S -R 5? )] we + (1 - we r N. oa /u, 
21h? 2h Yih 72h 
peat oo ea 
where R = TN and: 
Ny Yimh 
5° - 5? are the within stratum population variances on the foie and gag 
"ib »)e2h 


occasion, respectively. 


Composite Regression Estimation 


Another estimate of the mean/unit in each stratum is the regression 
estimate. Patterson [11] was the first to suggest such an estimate. The 
regression estimate can be written (as was the ratio) as a combination of the 
matched with the unmatched with an optimum weight t to minimize VG ren as 


follows: 


th 


y = y an S yo ey } tu, y 
reg {Yom : oy y (yy ake I inp a: 








(2 aye wl 


2 eee. 
where t = ives a2) (8) 
ik 


and u, is the unmatched sample size. 


2 





es 

The variance of Yoerae is | 
! 

(9) 


The gain of the regression estimate over the ratio estimate is, | 


G, = (V(y, oa AGES era UN AG IS ao) (10) 


and the gain of the regression over the mean per unit estimate is 








GE) CS SVR, Uy GY. ; 


G, is usually small except where A (the ratio of the C.V.'s) is large. 


For our particular application the gains (in variance) were not large for 





any state/stratum combination. The optimum percentage to match for the 


regression estimate is 


k 
d SNe dail (12). | 
BE ett Seale oat 


This generalizes Cochran's results [4] for unequal sample sizes on each ~ 


occasion. Generally the optimum percentage to match for the regression and 


the ratio estimate are not the same. However, for the quarterly hog survey 


the ratio of the C.V.'s between survey periods is nearly 1,(see Table A6 in 


Appendix A). Therefore, A = 1, implies # = 25-1 and 


opt, ratio becomes 


Sh 











Doeeetac.o ee PO ve ens 
It can be shown that Stay gst py pane re when A = 1, therefore, the ratio 


estimate always requires a larger matching percentage. 








